ReproHack Hub

Browse ReproHack papers

What do analyses of city size distributions have in common?

Authors: Clémentine Cottineau

DOI: 10.1007/s11192-021-04256-8

Submitted by clementinecottineau
Mean reproducibility score: 8.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
This article was meant to be entirely reproducible, with the data and code published alongside the article. It is however not embedded within a container (e.g. Docker). Will it past the reproducibility test tomorrow? next year? I'm curious.

Tags: R Meta-analysis science of science Zipf networks city size distribution urbanism literature review
Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA

Authors: Sahil Loomba, Alexandre de Figueiredo, Simon J. Piatek, Kristen de Graaf, Heidi J. Larson

DOI: 10.1038/s41562-021-01056-1

Submitted by samuelpawel
Mean reproducibility score: 7.0/10 | Number of reviews: 4
Why should we attempt to reproduce this paper?
In the middle of the COVID-19 pandemic, this paper provided important evidence regarding the effect of misinformation on vaccination intent. Its analyses and conclusions were extremely important for decision makers. Therefore, it is also important that the analyses are reproducible.

Tags: Python Jupyter Notebook Stan
Droplet impact onto a spring-supported plate: analysis and simulations

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus
Mean reproducibility score: 8.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The direct numerical simulations (DNS) for this paper were conducted using Basilisk (http://basilisk.fr/). As Basilisk is a free software program written in C, it can be readily installed on any Linux machine, and it should be straightforward to then run the driver code to re-produce the DNS from this paper. Given this, the numerical solutions presented in this paper are a result of many high-fidelity simulations, which each took approximately 24 CPU hours running between 4 to 8 cores. Hence the difficulty in reproducing the results should mainly be in the amount of computational resources it would take, so HPC resources will be required. The DNS in this paper were used to validate the presented analytical solutions, as well as extend the results to a longer timescale. Reproducing these numerical results will build confidence in these results, ensuring that they are independent of the system architecture they were produced on.

Tags: HPC C CFD Fluid Dynamics DNS Mathematics Droplets Basilisk
Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu
Number of reviews: 1
Why should we attempt to reproduce this paper?
This paper presents a fine example of high-throughput computational materials screening studies, mainly focusing on the carbon nanoclusters of different sizes. In the paper, a set of diverse empirical and machine-learned interatomic potentials, which are commonly used to simulate carbonaceous materials, is benchmarked against the higher-level density functional theory (DFT) data, using a range of diverse structural features as the comparison criteria. Trying to reproduce the data presented here (even if you only consider a subset of the interaction potentials) will help you devise an understanding as to how you could approach a high-throughput structure prediction problem. Even though we concentrate here on isolated/finite nanoclusters, AIRSS (and other similar approaches like USPEX, CALYPSO, GMIN, etc.,) can also be used to predict crystal structures of different class of materials with applications in energy storage, catalysis, hydrogen storage, and so on.

Tags: Python HPC LAMMPS DFT interatomic potentials Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning
Optimizing the Use of Carbonate Standards to Minimize Uncertainties in Clumped Isotope Data

Authors: Ilja J. Kocken, Inigo A. Müller, Martin Ziegler

DOI: 10.1029/2019GC008545

Submitted by japhir

Why should we attempt to reproduce this paper?
Even though the approach in the paper focuses on a specific measurement (clumped isotopes) and how to optimize which and how many standards we use, I hope that the problem is general enough that insight can translate to any kind of measurement that relies on machine calibration. I've committed to writing a literate program (plain text interspersed with code chunks) to explain what is going on and to make the simulations one step at a time. I really hope that this is understandable to future collaborators and scientists in my field, but I have not had any code review internally and I also didn't receive any feedback on it from the reviewers. I would love to see if what in my mind represents "reproducible code" is actually reproducible, and to learn what I can improve for future projects!

Tags: R tidyverse emacs literate earth sciences clumped isotopes org-mode geology
The viewing angle in AGN SED models, a data-driven analysis

Authors: Andrés Felipe Ramos Padilla, Lingyu Wang, Katarzyna Małek, Andreas Efstathiou, Guang Yang

Submitted by aframosp
Mean reproducibility score: 9.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
Most of the material is available through Jupyter notebooks in GitHub, and it should be easy to reproduce with the help of Binder. With the notebooks, you could experiment with different parameters to the ones analyzed in the paper. It also contains a large dataset of physical parameters of galaxies analysed in this work. We expect this work to be easily reproducible in the steps described in the repository.

Tags: Python Galaxies Astronomy HPC Databases Binder
pyKNEEr: An image analysis workflow for open and reproducible research on femoral knee cartilage

Authors: Bonaretti S, Gold GE, Beaupre GS

DOI: 10.1371/journal.pone.0226501

Submitted by hub-admin
Mean reproducibility score: 6.5/10 | Number of reviews: 2
Why should we attempt to reproduce this paper?
The paper describes pyKNEEr, a python package for open and reproducible research on femoral knee cartilage using Jupyter notebooks as a user interface. I created this paper with the specific intent to make both the workflows it describes and the paper itself open and reproducible, following guidelines from authorities in the field. Therefore, two things in the paper can be reproduced: 1) workflow results: Table 2 contains links to all the Jupyter notebooks used to calculate the results. Computations are long and might require a server, so if you want to run them locally, I recommend using only 2 or 3 images as inputs for the computations. Also, the paper should be sufficient, but if you need further introductory info, there are a documentation website: https://sbonaretti.github.io/pyKNEEr/ and a "how to" video: https://youtu.be/7WPf5KFtYi8 2) paper graphs: In the captions of figures 1, 4, and 5 you can find links to data repository, code (a Jupyter notebook), and the computational environment (binder) to fully reproduce the graph. These computations can be easily run locally and require a few seconds. All Jupyter notebooks automatically download data from Zenodo and provide dependencies, which should make reproducibility easier.

Tags: Python R Jupyter Notebook

Search for papers

Filter by tags

Python R GDAL GEOS GIS Shiny PROJ Galaxies Astronomy HPC Databases Binder Social Science Stata make Computer Science Jupyter Notebook tidyverse emacs literate earth sciences clumped isotopes org-mode geology eyetracking LaTeX Git ArcGIS Docker Drake SVN knitr C Matlab Mathematica Meta-analysis swig miniconda tensorflow keras Pandas SQL neuroscience robotics deep learning planner reiforcement learning Plasma physics Hybrid-PIC EPOCH Laser Gamma-ray X-ray radiation Petawatt Fortran plasma PIC physics Monte Carlo Atomistic Simulation LAMMPS Electron Transport DFT descriptors interatomic potentials machine learning Molecular Dynamics Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning RNA bioinformatics CFD Fluid Dynamics OpenFOAM C++ DNS Mathematics Droplets Basilisk Particle-In-Cell psychology Stan Finance SAS Replication crisis Economics Malaria consumer behavior number estimation mental arithmetic psychophysics Archaeology Precipitation Epidemiology Parkrun Health Health Economics HTA plumber science of science Zipf networks city size distribution urbanism literature review Preference Visual Questionnaire Mann-Whitney Correlation Conceptual replication Cognitive psychology Multinomial processing tree (MPT) modeling #urbanism #R k-means cluster analysis city-regions Urban Knowledge Systems Topic modelling Planning Support Systems Software Citation Quarto snakemake Numerical modelling Ocean climate physical oceanography apptainer oceanography All tags Clear tags

Key

Associated with an event
Available for general review
Public reviews welcome

Papers

Browse ReproHack papers

What do analyses of city size distributions have in common?

Authors: Clémentine Cottineau

DOI: 10.1007/s11192-021-04256-8

Submitted by clementinecottineau

Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA

Authors: Sahil Loomba, Alexandre de Figueiredo, Simon J. Piatek, Kristen de Graaf, Heidi J. Larson

DOI: 10.1038/s41562-021-01056-1

Submitted by samuelpawel

Droplet impact onto a spring-supported plate: analysis and simulations

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus

Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu

Optimizing the Use of Carbonate Standards to Minimize Uncertainties in Clumped Isotope Data

Authors: Ilja J. Kocken, Inigo A. Müller, Martin Ziegler

DOI: 10.1029/2019GC008545

Submitted by japhir

The viewing angle in AGN SED models, a data-driven analysis

Authors: Andrés Felipe Ramos Padilla, Lingyu Wang, Katarzyna Małek, Andreas Efstathiou, Guang Yang

Submitted by aframosp

pyKNEEr: An image analysis workflow for open and reproducible research on femoral knee cartilage

Authors: Bonaretti S, Gold GE, Beaupre GS

DOI: 10.1371/journal.pone.0226501

Submitted by hub-admin

Search for papers

Filter by tags

Key